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NUCLEOTIDE SEQUENCES CODING FOR A HUMAN PROTEIN 
WITH ANGIOGENESIS REGULATIVE PROPERTIES 
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30 



This invention relates to nucleotide sequences 
coding for a human protein having angiogenesis regulative 
properties . 

More particularly, this invention relates to the 
isolation and to the molecular characterization of a gene 
coding for a new protein having the properties of an 
angiogenic factor which regulates ijn vivo the formation 
and/or the regeneration of the vertebrate blood vessel 
system, and it also relates to the protein itself. 
Moreover, this invention also refers to vectors 
containing such sequence or parts thereof, to prokaryotic 
and eukaryotic cells transformed with such vectors, and 
to the employment of such vectors and of such cells for 
the production of the protein and of corresponding 
polyclonal and/or monoclonal antibodies as well. 

It is well known that growth factors are 
polypeptides, synthesized and secreted by mammalian 
cells, capable of acting not only on the proliferation, 
but also on the differentiation and morphogenesis of 
target cells- Indeed, it has been shown that some growth 
factors exert their action by regulating mecnanisms such 
as chemiotaxis, activation of inflammatory system rells 
and repairing of tissues (Whitman, M. and Melton, D. A., 
19S9, Annual Rev. Cell Biol., 5 ? 93-117). 

Because of the similar phenotype between cultured 
growth factors stimulated and retrovirus transformed 
cells, it has been suggested that common mecnanisms 
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control such phenomena. Indeed, the interaction between a 
growth factor and its own specif ic ' receptor indirectly 
activates gene activity regulative proteins, through 
intermediate reactions involving different 

5 pro tei n— k inases . Many of the components of this metabolic 

chain have been identified as the cellular analogs of 
viral oncogenes, suggesting how oncoviruses could 
interfere with normal cellular processes- 
Many growth factors have been identified up to the 
10 present time, the corresponding genes have been cloned, 

and such factors have been divided into groups, on the 
basis of similar activities and/or of sequence 
homologies; among them there is the family of angiogenic 
f ac tor s . 

15 Ang i ogenesi s , or the formation of vessels of the 

vascular system, is a complex process occuring during 
embryogenesis , wound healing and organ regeneration. 
Moreover, some pathologies like the growth of solid 
tumors, some retinopathies and rheumatoid arthritis 

HO induce an aberrant angiogenesis (Risau W., 1990, Progress 

in Growth Factor Research, 2, 71-79). 

Angiogenesis i n v i vo is a multi-step process, two 
of them being represented by the migration and the 
proliferation of endothelial cells devoted to the 

25 formation of vessels. 

In the most recent years, many angiogenic factors 
have been identified, and the corresponding genes cloned. 
Among them: angiogenin, sub jec t— mat ter of the patent 
application PCT no. 8701372; the platelet-derived 

30 endothelial growth factor PD-ECGF (Ishigawa et al . , 1989, 
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Nature, 338, 557); the human vascular permeability 
factor, VPF (Keck et al., 1989, Science 2^6, 1309), which 
Mas cloned also in the mouse with the denomination of 
vascular endothelial growth factor, VEGF (Leung et al., 
5 1989, Science, 2^6, 1306); the growth factors for 

fibroblasts, i.e., the acid factor, a-FGF , and the basis 
factor, b-FGF, the transforming growth factors alpha, 
TGF _ #-5 anc j beta, TGF-. - (Folkman and Klagsburn, 1987, 

Science, 235, ^2 > . 

IQ Angiogenic factors have been divided into two 

groups, according to their way of actions either 
directly on the vascular endothelial cells, by 
stimulating motility or mitosis, or indirectly on cells 
producing growth factors acting on endothelial cells. 

!5 I_n vi tro analysis have put into evidence that 

angiogenic factors exert different effects on the 
motility and on the proliferation of endothelial cells. 
Indeed, some of them stimulate just one of the two 
events, other ones stimulate both events, whereas others 

20 seem to be ineffective in vitro , and , lastly, other ones 

show even an inhibiting activity of the endothelial 
cellular proliferation. Such data point out that the 
regulation of angiogenesis is a complex process mediated 
by diffrent components, many of which have not been 

25 identified as yet. 

Accordingly it is evident that there is the need 
for identifying and isolating new angiogenic factors 
capable of stimulating the migration and differentiation 
of endothelial cells, to be utilized both in the 

30 diagnostic field, as tumoral markers and for inflammatory 
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diseases, and in the therapeutical field, for topic or 
internal use, for instance in the treatment of wounds, of 
tissues after a surgical operation, of transplantation, 
of burns, ulcers, etc.. Such factors can be employed 
5 successfully also i n v i tro . as growth-stimulating of cell 

cul tures . 

Moreover, DNA recombinant techniques allow such 
factors to be produced in suitable amounts, in short 
times and at remarkably low costs. 

0 Indeed, there is an increasing need for identifying 

new specific tumoral markers because of uncertainties in 
tumor diagnosis. Moreover, recent methods for producing 
hybrid proteins (Fitzgerald D. and Pastan I., 1989, J. 
Natl. Cancer Inst. 81, 1^55—1^63) and/or conjugate 

5 antibodies (Pearson, J. W. et al., 1989 Cancer Res. *t9, 

3562-3567) with toxic molecules, are giving promising 
results in the field of tumoral serotherapy, with an 
increasingly growing demand for new factors to test. 
Finally, many of angiogenic factor genes have been cloned 

0 starting from tumoral cells, whereas a better 

applicability in the therapeutic field of genes coming 
from non-neop 1 as t i c material is evident. 

Accordingly, this invention provides nucleotide 
sequences coding for a protein having a regulative 

5 activity of ang iogenes i s , said sequences being obtained 

from non-neop 1 ast ic tissue; vectors containing said 
sequences; cells transformed by said vectors and 
producing protein having biologic and/or immunologic 
activities of a new angiogenic factor, as well as the 

O protein itself, to be employed in diagnostic and 
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therapeutic fields. 

This invention also provides a procedure for 
obtaining the protein, or parts thereof, by recombinant 
techniques, as well as its use as an antigen for the 
production of the corresponding polyclonal or monoclonal 

ant i bod i es . 

Indeed, molecular probes comprising sequences 
coding for the angiogenic factor subject-matter of the 
present invention can be employed as markers in the 
diagnosis of pathologies related to the aberrant 
production thereof, as the case of some tumoral 
pathologies for other angiogenic factors. 

Moreover, the protein which is another 

subject-matter of this invention can be employed in the 
treatment of inflammatory diseases, in the treatment of 
wounds, of tissues after surgical operations, of 
transplantation, of burns of ulcers and so on. Such 
factor can also be employed in vitro successfully, as 
growth stimulating of cell cultures. 

Finally, DNA recombinant techniques employed in the 
present invention allow to produce the molecular probes 
and proteins described above in suitable amounts, in 
short times and at remarkably reduced costs. 

The nucleotide and amino acids chains of this 
invention can be employed for diagnostic tests and for 
theraupetic purposes, both as directly derived from host 
cells and as after suitable modifications, for obtaining 
a better production for compositions. 

Accordingly, the object of this invention consists 
10 in nucleotide sequences coding for a pr o te i n , named P1GF, 



0 



15 
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with immunogenic and/or biologic properties of an 
angiogenesis regulative factor, having the amino acids 
sequence of SEQ ID Nl. 

As another embodiment of the invention, the P1GF 
amino acid sequence derives from alternative splicings of 
the primary transcript, preferably at the nucleotide 
sequence shown in SEQ ID N2 , most preferably giving rise 
to an amino acid insertion of 21 amino acids, whose 
sequence is shown in SEQ ID N2 , at position 14-1 — 14-2 of 
the amino acid sequence shown in SEQ ID Nl . 

An object of this invention also consists in 
nucleotide sequences coding for the P1GF protein, lacking 
and/or substituted in one or more amino acids, preferably 
deleted from the amino acid 1 to the amino acid 31 of SEQ 
ID Nl ; the present invention also provides nucleotide 
sequences which are allelic derivatives of the sequence 
coding for SEQ ID Nl , as well as nucleotide sequences 
that are complementary to those coding for SEQ ID Nl. 

Again according to the present invention the 
nucleotide sequence can be covalently bounded to a 
nucleotide sequence which can be translated into amino 
acid sequence by employing the same reading frame of the 
gene coding for P1BF, which preferably does not interfere 
with the angiogenesis regulative activity of P1GF, and 
which more preferably codes for a protein portion having 
toxic activity. 

Accordingly, the object of this invention also 
consists in the nucleotide sequence of SEQ ID Nl , even 
though the same is lacking and/or substituted in one or 
more nucleotides, coding at its coding part for the 
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protein P1GF, as in SEQ ID Nl . 

The present invention also provides nucleotide 
sequences hybridizing with SEQ ID Nl , or parts thereof; 
nucleotide sequences obtained both through natural and 
5 synthetic or semisynthetic methods, by substitution, 

deletion, insertion and inversion mutations, either 
concerning single bases or multiple bases, of sequence 
described in SEQ ID Nl , or parts thereof; and nucleotide 
sequences comprising sequences coding for a protein 

10 having immunogenic and/or biologic properties similar to 

those exhibited by the protein P1GF or parts thereof. 

A further aspect of this invention relates to the 
protein P1GF having the sequence disclosed in SEQ ID Nl , 
or parts thereof, obtained either by means of recombinant 

15 DNA techniques or isolated from biologic tissues. Said 

protein, or parts thereof which are immunologically 
active, can be employed as antigenes for producing 
polyclonal and/or monoclonal antibodies. 

The present invention also provides cloning and/or 

20 expression vectors, both prokaryotic and eukaryotic, 

comprising the nucleotide sequences subject-matter of the 
invention, sequences promoting transcription located 
upstream and, in general, a selective marker. Preferably, 
sequences promoting transcription in an inducible manner, 

25 can also be present and enhancers, po 1 yadeny 1 a t ion 

signals and so on, as well. 

Again an object of the present invention consists 
in prokaryotic and eukaryotic cells transformed by said 
vectors to be employed for producing the P1GF protein or 

30 parts therof • 
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Just for illustrative and not for limitative 
purposes the present invention will be described in the 
following examples. In what follows reference will be 
made to the enclosed Figures wherein: 
5 Figure 1 represents the restriction map of the 

recombinant A GT11 phage, comprising the sub 32 fragment; 

Figure 2 represents a "Northern blot*' experiment 
employing the sub 32 cDNA fragment; 

Figure 3 represents the restriction map of the 
0 plasmid pPlGF-2; 

Figure represents an exemplificative scheme of 

subcloning of a fragment coding for a portion of the 
protein P1GF in the expression vector pET3 (Novagen, 
Madison WI , USA) ; 

5 Figure 5 represents a po lyacry lamide gel 

electrophoresis of the protein P1GF, said protein being 
obtained through the recombinant way. 

Example 1 

I so 1 at ion of the cDNA cod inq for a new ana iooen i c factor 
0 A first cDNA fragment, named sub 32 , was isolated 

from a clone of a cDNA library from human placenta, in 
the A GT11 vector, according to conventional procedures 
and employed also in other laboratories (Wataneb et al . , 
J. Biol. Chem. 26^, 12611-19, 19B9). 
5 Briefly, RNA was extracted through lysis with 

guanidine thiocyanate and centrifuging on a discontinuous 
gradient of caesium (Sambrook J., Fritsch E.F., Maniatis 
T-, Molecular Cloning - A Laboratory Manual. Second 
edition, Vol. 1, 7.19. Cold Spring Harbor Lab. Press). 
0 The poly A-»- RNA was purified through chromatography on 
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oligo-dT cellulose (ibid. 7.26). The cDNA synthesis and 
the cloning of the A GT11 phage vector (Stratagene, La 
Jolla California, USA) in the Eco Rl restriction site was 
carried out following the protocol described ibid. 
5 8.5*f-B.79. A clone, whose map is shown in Figure 1, was 

identified because of comprising also a sequence of 2600 
nucleotides capable of hybridizing, in 5x SSC at 65C 
according to the hybridization procedure on filters 
described ibid. B.4-6, with a sequence coding for the cDNA 

10 of the glucose-fa-phosphate dehydrogenase enzyme <G6PD> 

(Persico, M . et al. , 1986, Nuc 1 . Acid Res., l<t, 2511). A 
fragment of 2^0 nucleotides was also isolated from this 
recombinant phage after digestion with Eco Rl and Bam HI, 
and the fragment was called sub 32. Said fragment, after 

15 labelling with 32P by means of the "nick translation" 

procedure disclosed ibid. 10.6-10. B was employed for: 

a> analyzing RNAs extracted from different tissues 
or cell lines, by "Northern blot*' procedure as descrioed 
in ibid. 7.37. The results shown in Figure 2 show that 

20 the sub 32 fragment detects specific mRNA in the placenta 

(line 2), in HEPG2 hepatoma cells, ATCC N . HB8065 (line 
3), in JEG human choriocarcinoma cells, ATCC N. HTB36 
(line *f) and, at lower concentration, in Hela 53 cells, 
ATCC N. CCL2.2 (line 5), but not in HL60 cells, ATCC N. 

25 CCL2<fO (line 1); 

b) screening a cDNA library from JEG human 
choriocarcinoma JEG, ATCC N. HTB36, according to the 
procedures described for the cDNA library from human 
placenta, in the /\ GT10 vector (Stratagene, La Jolla 

30 California, USA), in the Eco Rl site. Two clones were 
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isolated, digested with Eco Rl and subcloned in the pUC 
18 vector (Stratagene, La Jolla, California, USA> and the 
sequence determined by Sanger •s method (ibid. 
13.6—13.10)- The sequence revealed the fragmnents 

5 overlapped partially one another, but did not comprise 

the whole sequence coding for the corresponding mRNA. 
Hence, the isolated fragments were employed for a second 
screening, employing the same techniques. The library 
employed was the cDNA library from human placenta, from 

O which the initial sub 32 fragment came from. Then two 

clones were isolated, their DNA was digested with Eco Rl , 
the resulting inserts were subcloned in the pGEM 1 vector 
(Promega Corporation, Madison WI . , USA) and their 
sequence was determined by Sanger's method. The two DNA 

5 fragments obtained after digestion with Eco Rl were 

religated together through T^-ligase and cloned in the 
same pGEM 1 vector in the Eco Rl site, to obtain the 
whole cDNA sequence corresponding to the mRNA present in 
the placenta, in a single plasmid, called pPlGF-£ ( ATCC 

O Dep. No. <fOB92), whose map is shown in Figure 3. 

In order to confirm that the resulting fragment 
covers the whole coding sequence, the sequence was 
compared with the sequence of a genoma fragment obtained 
after hybridization of the same fragment with a genomic 

15 library from human fibroblasts WI3B (No. 9*+*t201 

Stratagene, La Jolla, California, USA) in the a Fix 
vec tor . 

The cDNA sequence was identified according to 
Sanger's method <ibid. 13.3-13.10) and revealed: 
(0 a) a 5' end untranslated region of 321 nucleotides 



WO 92/06194 PCT/IT91/00079 

- 11 - 

comprising a sequence capable of forming a stem-loop 

secondary structure, indicative of a translation 

regulative signal ; 

b> a sequence of ^7 nucleotides with an open 
5 reading frame coding for a protein of 1<*9 amino acids, 

comprising a hydrophobic sequence of 32 amino acids at 
the NHa-Lprminal, indicative of the signal peptide of 
secreted proteins; 

c) a 3* end untranslated region of 877 nucleotides 

10 comprising a po lyadeny lat i on site. 

The amino acid sequence, deduced by the cDNA 
sequence, was inserted into the European Molecular 
Biology Laboratory < EMBL > Data Bank, showing no protein 
with the same sequence. A 507. homology, limited to a ISO 

15 amino acid region, was shown with the vascular 

permeability factor VPF (Keck et al., 19B9, Science, 2^6, 
1309), a powerful angiogenic factor, thus suggesting that 
the new protein P1GF can have by itself an 
ang iogenes i s-regul a t ing activity. 

20 Example 2 

Screening of a cDNA 1 ibrary from JEG-3 eel Is wi th pPLGF 
ana struc ture of P 1 GF-qene 

A cDNA library, obtained from JEG-3 cell mRNA , was 
screened with the P1GF probe. Six recombinant phages were 

25 isolated- The sequence of two of them revealed they have 

a lenght of 510 bp, generating a 170 amino acid protein. 
The sequence resulted to be identical to the cDNA 
isolated from placenta, but an insertion of 63 bp, 
generating a 21 amino acid insertion into the protein, at 

30 position 1^1-142. Interestingly, the new sequence 
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contains 10 basic amino acids (Arg and Lys ) over Si. 

Example 3 

enomic mapping and cloning of the P1GF gene ? 
The gene coding for the protein P1GF Mas mapped on 
5 the chromosome 1 *f through "Southern blot" analysis, by 

employing DNA from different hybrid cellular lines, each 
containing different human chromosomes (not shown). 

The structure and part of the nucleotide sequence 
of the P1GF gene was determined from a human genomic 
10 library- The gene is divided into six exons and five 

intervening sequences generating, through splicing, the 
transcript coding for the 1^9 aa. protein. In 
choriocarcinoma cells (JEG— 3> the primary transcript is 
alternatively spliced at the fifth intron to generate a 
15 transcript coding for the 170 aa . (see SEQ ID NE ) . 

Another alternative splicing involving the sequence from 
17<t to B2B of SEQ ID N2 of the fifth intron, gives rise 
tc an higher molecular weigth P1GF protein. In fact two 
proteins are i mmunoprec ip i t a ted from JEG— 3 conditioned 
SO medium, with antibodies anti P1GF. 

Example 

SuOc loning of P1GF cDNA i n a prokaryot i c express i on 
vector 

A scheme of the subcloning strategy is shown in 
£5 Figure *t , wherein the pET3 vector was employed (Novagen 

Madison Wi ; USA) containing essentially the T7 phage RNA 
polymerase promoter, the terminator of the same phage, an * 
origin of the replication (ori) and the resistance to 
ampicillin (amp). 

30 The cDNA insert to be subcloned was obtained 
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through PCR amplification (polymerase chain reaction, 
ibid. 14.6), generating a cDNA coding for the protein 
lacking the first 31 amino acids- As template, the Eco 
Rl DNA fragment, from nucleotide 1 to nucleotide 940, was 
employed. As primers for RNA polymerase the following 
oligonucleotides were employed, synthesized with an 
"Applied Biosystem 381A M o 1 igo-synthes i zer : 

- oligonucleotide A complementary to the coding strand 
from the nucleotide 768 to the nucleotide 7B7 , in which 
the GGATCC sequence, Bam HI recognition site, was 
inserted between nucleotides 775 and 776, having the 

following sequence: 

5 ' -TCCTCCAAGGGGATCCTGGGTTAC-3 ' 

BamHl 

- ol igonuceot ide B complementary to the non-coding strand 
from nucleotide 404 to nucleotide 431, in which the 
CATATG sequence, Nde 1 recognition site, was inserted 
between the nucleotides 414 and 415. having the following 

sequence : 

3 ' — GCAAGGG GTATACT CGTCTGTTCC—5 ' 

Ndel 

The nucleotide chain, ootained from PCR, was 
digested with Nde 1 and Bam HI and 1 igated with the 
prokaryotic pET3 expression vector in the same Nde 1 and 
Bam HI sites according to standard protocols. The 
product was employed for transforming the E.coli HB101 
strain which had been made competent with the CaC12 
method. The recombinant plasmid was identified and 
employed for transforming the E.coli JMIO 5 ? strain CDE3. 
Promega Corporation, Madison WI , USA). 
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Example 5 

Svnthes i s and iso 1 at ion of the P1GF pro tei n from bacter i a 
A single colony was inoculated in L3 broth 
containing 100 ^ig/ml of amplicillin (Sigma, St. Louis 
5 MO., USA) and g/1 of glucose and then grown at 37C to 

reach an optical density O.D. of 0.35 at 600 nm. IPTG 
(Sigma) was added to a 1 mM final concentration and the 
culture was incubated at 37C for additional 3 hours- The 
culture was centrifuged and resuspended in l/lO of the 

lO initial volume of a buffer containing 10 mM Tris-HCl, 1 

mM EDTA , pH B.O ( TE ) . Following further centr if ugat ion , 
the precipitate was resuspended in 1/60 of the initial 
volume into a lysis buffer containing TE, IV. SD5 , 0.1 M 
NaCl . Bacteria were divided into aliquots of 500 ul and 

15 subjected to lysis by three cycles- of freezing and 

thawing, followed by middle strength sonication. 

An example of the resulting e 1 ec tr ophor et ic pattern 
is shown in Figure 5, wherein lines 1, 2, 3 and 
represent el ec tropnoret ic patterns of oroteins from 

20 lysates respectively 0, 1, 2 and 3 hours after IPTG 

induction. As control, line 5 represents the same strain 
transformed only with the vector lacking the insert, 
induced with IPTG for 3 hours. Electrophoresis was 
carried out according to Laemli, Nature, 227, b80— 685, 

25 1970, in a 15V» po 1 yacr y 1 am i de gel stained according to 

the method described by Bradley et al . , Anal* Biochem. 
182, 157-159 (1989). 

Example 6 

Product ion of an t i P 1 GF a nt lbod ies and 

30 immunoprec io i tat ions of P IGF 
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70 ug of the protein P1GF was employed for 
immunizing two chickens, as described by Gassmann et al . , 
1990 Faseb J- 2528-2532. The antibodies so formed were 

extracted and purified from the yolk througn 
5 precipitation with polyethylene glycol (PEG) as described 

by Gassmann et al- <cf. above). The i mmunopr ec i p i t a t i ons 
were performed by incubating 120-250 pi of cellular 
lysate, or Cos-1 cell conditioned medium, with 10 or 15 
ul of rabbit or chicken antibodies, for 2 hours at room 
10 temperature, or 16 hours at 4C. The immunor eac t i ons with 

chicken antibodies were further treated with 15 ul of 
rabbit anti-chicken IgG (SIGMA N . C6778). for 1 hour at 

room temperature. 

The immunocomplex was selectioned through 

15 protein-Sepharose <tB (Pharmacia) and washed twice with 

1.2 ul of PBS with 0.017. Nonidet-P^O and ^OO uM of NaCl. 

The immunoprec ip i tates were then resuspended and 
analyzed on po 1 yacr y 1 am i de gel under denaturing and 
reducing conditions according to standard orocedure. If 

20 COS cells had been previously transfected with the 

plasmid pSVL-PlGF, a protein of 25 KDa molecular weight 
is immunoprecipi tated , both from the lysate and from the 
culture medium. 
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SPECIMEN SEQUENCE LISTING 

SEQ ID N. 1 

SEQUENCE TYPE: nucleotide with corresponding 

protein at the coding region 
SEQUENCE LENGTH: 1645 bp 
STRANDEDNESS : s i ng 1 e 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA to mRNA 
HYPOTHETICAL SEQUENCE: no 
ANT I -SENSE: no 
ORIGINAL SOURCE: cDNA placental library 
ORGANISM: human 

IMMEDIATE EXPERIMENTAL SOURCE: pPlGF-2 (ATCC 

N.4089E) 

15 FEATURES: 

from 1 to 321 bp 5' end untranslated region 

from 322 to 768 bp coding region 

from 769 to 1645 bp 3' end untranslated region 

PROPERTIES: 



10 



20 



10 20 30 40 50 60 

GGGATTCGGGCCGCCCAGCTACGGGAGGACCTGGAGTGGCACTGGGCGCCCGACGGACCA 



70 80 90 100 110 120 

TCCCCGGGACCCGCCTGCCCCTCGGCGCCCCGCCCCGCCGGGCCGCTCCCCGTCGGGTTC 



130 140 150 160 170 180 

25 CCCAGCCACAGCCTTACCTACGGGCTCCTGACTCCGCAAGGCTTCCAGAAGATGCTCGAA 



190 200 210 220 230 240 

CCACCGGCCGGGGCCTCGGGGCAGCAGTGAGGGAGGCGTCCAGCCCCCCACTCAGCTCTT 
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250 260 270 280 290 300 

CTCCTCCTGTGCCAGGGGCTCCCCGGGGGATGAGCATGGTGGTTTTCCCTCGGAGCCCCC 



310 320 330 340 350 360 

TGGCTCGGGACGTCTGAGAAGATGCCGGTCATGAGGCTGTTCCCTTGCTTCCTGCAGCTC 
5 MetProValMetArgLeuPheProCysPheLeuGlnLeu 

370 380 390 400 410 420 

CTGGCCGGGCTGGCGCTGCCTGCTGTGCCCCCCCAGCAGTGGGCCTTGTCTGCTGGGAAC 
LeuAlaGlyLenAlaLeuProAlaValProProGlnGlnTrpAlaLeuSerAlaGlyAsn 

430 440 450 460 470 480 

10 GGCTCGTCAGAGGTGGAAGTGGTACCCTTCCAGGAAGTGTGGGGCCGCAGCTACTGCCGG 

GlySerSerGluValGluValValProPheGlnGluValTrpGlyArgSerTyrCysArg 

490 500 510 520 530 540 

GCGCTGGAGAGGCTGGTGGACGTCGTGTCCGAGTACCCCAGCGAGGTGGAGCACATGTTC 
AlaLeuGluArgLeuValAspValValSerGluTyrProSerGluValGluHisMetPhe 

15 550 560 570 580 590 600 

AGCCCATCCTGTGTCTCCCTGCTGCGCTGCACCGGCTGCTGCGGCGATGAGAATCTGCAC 
SerProSerCysValSerLeuLeuArgCysThrGlyCysCysGlyAspGluAsnLeuHis 

610 620 630 640 650 660 

TGTGTGCCGGTGGAGACGGCCAATGTCACCATGCAGCTCCTAAAGATCCGTTCTGGGGAC 
20 CysValProValGluThrAlaAsnValThrMetGlnLeuLeuLysIleArgSerGlyAsp 

670 680 690 700 710 720 

CGGCCCTCCTACGTGGAGCTGACGTTCTCTCAGCACGTTCGCTGCGAATGCCGGCCTCTG 
ArgProSerTyrValGluLeuThrPheSerGlnHisValArgCysGluCysArgProLeu 

730 740 750 760 770 780 

25 CGGGAGAAGATGAAGCCGGAAAGGTGCGGCGATGCTGTTCCCCGGAGGTAACCCACCCCT 

ArgGluLysMetLysProGluArgCysGlyAspAlaValProArgArg 

790 800 810 820 830 840 

TGGAGGAGAGAGACCCCGCACCCGGCTCGTGTATTTATTACCGTCACACTCTTCAGTGAC 



850 860 870 880 890 900 

30 TCCTGCTGGTACCTGCCCTCTATTTATTAGCCAACTGTTTCCCTGCTGAATGCCTCGCTC 

910 920 930 940 950 960 

CCTTCAAGACGAGGGGCAGGGAAGGACAGGACCCTCAGGAATTCAGTGCCTTCAACAACG 



970 980 990 1000 1010 1020 

TGAGAGAAAGAGAGAAGCCAGCCACAGACCCCTGGGAGCTTCCGCTTTGAAAGAAGCAAG 
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1030 1040 1050 1060 1070 1080 

ACACGTGGCCTCGTGAGGGGCAAGCTAGGCCCCAGAGGCCCTGGAGGTCTCCAGGGGCCT 



1090 1100 1110 1120 1130 1140 

GCAGAAGGAAAGAAGGGGGCCCTGCTACCTGTTCTTGGGCCTCAGGCTCTGCACAGACAA 



1150 1160 1170 1180 1190 1200 

GCAGCCCTTGCTTTCGGAGCTCCTGTCCAAAGTAGGGATGCGGATTCTGCTGGGGCCGCC 



1210 1220 1230 1240 1250 1260 

ACGGCCTGGTGGTGGGAAGGCCGGCAGCGGGCGGAGGGGATTCAGCCACTTCCCCCTCTT 



1270 1280 1290 1300 1310 1320 

rrTCTGAAGATCAGAACATTCAGCTCTGGAGAACAGTGGTTGCCTGGGGGCTTTTGCCAC 



1330 1340 1350 1360 1370 1380 

TCCTTGTCCCCCGTGATCTCCCCTCACACTTTGCCATTTGCTTGTACTGGGACATTGTTC 



1390 1400 1410 1420 1430 1440 

TTTCCGGCCGAGGTGCCACCACCCTGCCCCCACTAAGAGACACATACAGAGTGGGCCCCG 



1450 1460 1470 1480 1490 1500 

GGCTGGAGAAAGAGCTGCCTGGATGAGAAACAGCTCAGCCAGTGGGGATGAGGTCACCAG 



1510 1520 1530 1540 1550 1560 

GGGAGGAGCCTGTGCGTCCCAGCTGAAGGCAGTGGCAGGGGAGCAGGTTCCCCAAGGGCC 



1570 1580 15$0 1600 1610 1620 

CTGGCACCCCCACAAGCTGTCCCTGCAGGGCCATCTGACTGCCAAGCCAGATTCTCTTGA 



1630 1640 
ATAAAGTATTCTAGTGTGGAAACGC 



WO 92/06194 



PCT/IT91/00079 



- 19 - 

SEQ ID N.2 

SEQUENCE TYPE: nucleotide with the corresponding 

aminoacid sequence for a reading frame 

SEQUENCE LENGTH: BEB bp 
5 STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: genomic DNA 

HYPOTHETICAL SEQUENCE: no 

ANT I -SENSE: no 
10 ORIGINAL SOURCE: genomic library 

ORGANISM: human 

IMMEDIATE EXPERIMENTAL SOURCE: plqfis5 
FEATURES: 

from 1 to BE8 bp 5* end of the fifth 

-|g intervening sequence of the P1GF gene 

from 110 to 17E bp coding region due an 
alternative splicing of the primary transcript 
from 175 to 828 bp coding region aue to another 
alternative splicing of the primary transcript 

20 PROPERTIES: 

10 20 30 40 50 60 

GTAAGTGGTTTGGCTGGGGCTCGGGGCTATTCTATTCTCGGGCCTGCCAGCCTCTGTCCT 

70 80 90 100 110 120 

AGCATGGGGTTCCCCAGCCACCTTGTCCTGACGCTTGGCTTATTGCAGGAGGAGACCCAA 

ArgArgProLy 

25 130 140 150 160 170 180 

GGGCAGGGGGAAGAGGAGGAGAGAGAAGCAGAGACCCACAGACTGCCACCTGTGAGTGCG 
sGlyArgGlyLysArgArgArgGluLysGlnArgProThrAspCysHisLeu 
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190 200 210 220 230 240 

CGGGGTCCCAGGGATGGCGAGGAGGCTGGGCCCGAGGGGAGCCCCGCCTTGCCGCCAGGG 



250 260 270 280 290 300 

TTAGGTTGGGGAGGGGGAGAGGCAGGACTGAGGCCAGTCTTGGGG ( G ) CAG AACAGGGAN 



310 320 330 340 350 360 

CTGCACCTCCTCAAGACTCTAGGGCCCAGGAAGCATCAGTGGACCTTGGTTTTTATCCCG 



370 380 390 400 410 420 

GCTTAGCCTAGGTTTCCATTGACCTTCAACAAATCATTTCACCTTTGTCAGCCTAGCTTT 



430 440 450 460 470 480 

10 TCTCTGTGTAGAATGAGGGGCAGGAGGTCCAGCAAACATTCAGTCACTCTACAAACATTT 



490 500 510 520 530 540 

AC TG AG C A CTT ACTGTGTGT C AGGT AC AT CTGTG AGC AAAC AAAC AGG AT TC CTG C ACAT 



550 560 570 580 590 600 

TAGTGTTTACCTTTTAGTGATTAAAAGTCTGTCATCAGCTGAGACGTTATCTGGGGCCAC 



15 610 620 630 640 650 660 

TTCCTAGTAGCCCGGGGAACATGTGCCCTCNCACTGTCTCCCAGGAGTATTTTTGCCTGT 



670 680 690 700 710 720 

GGGTCCCCTTGCTGCTTCTAACCCACTTCGTACCTTGTGGGCAGCAGAATGGAGCCCCAG 



730 740 750 760 770 780 

20 GCCTGAGTGTGGCTGGGAGAGAAGGATGAGAGGAGGGAAAACCCAAATCTGTGAGAGTAA 



790 800 810 820 

ATAGAAAAAATAAAATATTTCACGTGCACAGTCAATCAGTCACTGAAG 
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CLAIMS 

1. A nucleotide sequence comprising a sequence 
coding for a protein with angiogenesis regulative 
activity having the sequence of SEQ ID Nl . 
5 S. A nucleotide sequence according to claim 1 

wherein said coded protein derives -from alternative 
splicings, different from those generating the sequence 
protein of SEQ ID Nl . 

3. A nucleotide sequence according to claim 2 
10 wherein said alternative splicing generates a transcript 

coding for a protein with SEQ ID Nl , with an insertion of 
21 aa., as in SEQ ID N2 , at position 1<*1-1<*S of SEQ ID 
Nl . 

<t. A nucleotide sequence according to any of the 
15 previous claims, wherein said protein lacks one or more 

amino acids. 

5. A nucleotide sequence according to any of the 
previous claims, wherein said protein has an altered 
regulative activity of angiogenesis. 
20 6. A nucleotide sequence according to claims or 5 

wherein said protein is deleted from the amino acid 1 to 

the amino acid 31. 

7. A nucleotide sequence according to any of claims 
1 to 3 wherein one or more amino acids have been 

25 substituted in said protein. 

8. A nucleotide sequence according to claim 7, 
wherein said protein shows an altered regulative activity 

of angiogenesis. 

9. A nucleotide sequence according to any of the 
30 previous -laims wherein said protein is an allelic 
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derivative. 

10. A nucleotide sequence complementary to a 
sequence according to any of the previous claims. 

11. A nucleotide sequence according to any of the 
5 previous claims, to which a nucleotide sequence, that can 

be translated with the same reading frame, has been 
covalently linked to the 5' or to the 3 1 positions. 

12. A nucleotide sequence according to claim 11 f 
wherein said linked nucleotide sequence codes for an 

10 amino acid sequence which does not interfere with the 

regulative activity of angiogenesis of the protein. 

13. A nucleotide sequence according to claims 11 or 
12 wherein said linked nucleotide sequence codes for a 
protein having a toxic activity. 

15 1^- A nucleotide sequence comprising the sequence of 

SEQ ID Nl, wherein the 5 ' end region from the nucleotide 1 
to the nucleotide 321 is untranslated, the region from 
the nucleotide 322 to the nucleotide 768 codes for a 
protein with angiogenesis regulative activity, the 3' end 

HO region is untranslated. 

15. A nucleotide sequence according to claim l*t 
comprising at least a part of sequence of SEQ ID N2 . 

16. A nucleotide sequence according to claim l*f or 
15 lacking one or more nucleotides, coding for a protein 

25 having a regulative activity of angiogenesis. 

17. A nucleotide sequence according to claims 1^ or 
15 lacking one or more nucleotides, coding for a protein 
having an altered regulative activity of angiogenesis. 

18. A nucleotide sequence according to claims 16 or 
17 which is deleted from the nucleotide 322 to the 



30 
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nucleotide 414. 

19. A nucleotide sequence according to claims 14 or 
15 wherein one or more nucleotides have been substituted, 
coding for a protein having a regulative activity of 

5 ang iogenesis • 

20. A nucleotide sequence according to claims 14 or 
15 wherein one or more nucleotides have been substituted, 
coding for a protein with an altered regulative activity 

of ang iogenes i s . 

El. A nucleotide sequence according to claims 14 or 
15, wherein said untranslated 5' region comprises a 
regulative region of the translation. 

22. A nucleotide sequence according to claim El, 
wherein said regulative region forms a stem-loop 
15 secondary structure. 

£3. A nucleotide sequence which is allelic to the 
nucleotide sequence according to claims 14 or 15. 

24. A nucleotide sequence which is complementary to 
a nucleotide sequence according to any of the previous 

20 claims from 14 to 23. 

E5. A protein comprising the amino acid sequence of 
SEQ ID Nl having a regulative activity of ang i ogenes i s . 

36. A protein according to claim E5 wherein said 
protein comprises an insertion derived from alternative 
25 splicing of the primary transcript. 

E7- A protein according to claim 26 wherein said 
insertion is at position 141-142 of SEQ ID Nl and 
comprises a sequence coded by at least a portion of SEQ 
ID N2. 

30 28. A protein according to claim 27 wherein said 
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sequence is of 21 amino acids as in SEQ ID N2. 

29. A protein according to any of claims 22 from 25 
to 28 wherein said protein lacks one or more amino acids. 

30. A protein according to claim 29, wherein said 
5 protein shows an altered regulative activity of 

ang iogenes is . 

31. A protein according to claims 29 or 30, wherein 
said protein is deleted from the amino acid 1 to the 
amino ac id 31 . 

10 32. A protein according to any of claims from 25 to 

28 wherein one or more amino acids have been substituted 
in said protein. 

33- A protein according to claim 32 wherein said 
protein shows an altered regulative activity of 

15 angiogenesis 

wh i ch 

is an allelic derivative of the nucleotide sequence 
according to any of claims from 1 to 3 . 

35. A protein comprising an amino acid sequence 
20 according to anyone of the previous claims from 25 to 3**, 

to which an amino acid chain is covalently linked at the 
terminal CQOH or NH2 groups. 

36. A protein according to claim 35, wherein said 
amino acid chain does not interfere with the regulative 

25 activity of the angiogenesis of the protein itself. 

37. A protein according to any of claims 35 or 36, 
wherein said amino acid chain shows a toxic cell 
act i v i ty . 

38. A vector comprising: a) the replication origin 
30 of a bacterial pi asm id, b) a selective marker, c> a 



3^+. A protein coded by a nucleotide sequence 
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promoter and under th« control of said promoter, d) the 
nucleotide sequence according to any of the previous 

claims from 1 to 24 . 

39. ft vector according to claim 3B, wherein said 

5 selective marker is a gene coding for an antibiotic 

res i s tance . 

40. A vector according to any of claims 38 or 39, 
wherein said promoter is T7 phage RNA polymerase 
promo ter . 

1Q m ft vector according to any of claims from 3B to 

40 wherein said vector comprises the plasmid pGem 1. 

42. A vector according to any of claims from 3B to 
40 wherein said vector comprises the plasmid pET3. 

43- A vector according to any of claims from 38 to 
!5 comprising one or more sequences that regulate the 

transcription. 

44. A vector according to claim 43, wherein said 
sequences comprise a transcription enhancer. 

45. A vector according to claim 43 wherein said 
20 sequences comprise an inducible promoter. 

46. A vector according to any of claims from 38 to 
45 comprising a po lyadeny lat ion site. 

47. Cells transformed with a vector according to any 

of claims from 38 to 46. 
25 ^8. Cells according to claim 47 wherein said cells 

are bacterial cells. 

49. Cells according to claim 48 wherein said 

bacteria are E.coli. 

50. Cells according to claim 47 wherein said cells 
30 are eukaryotic cells. 
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51. A process for the production and extraction of 
the protein P1GF from cells according to any of claims 
from 4-7 to *f 9 , said process comprising: growing the 
bacterial culture in liquid medium to an optical density 
5 between 0.2 and resuspending in TE; centrifuging and 

resuspending in a lysis buffer; lysing the cells. 
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